07. Dataset Schema Analysis
Dataset Schema Analysis
ND320 AIHCND C01 L01 A07 Dataset Schema Analysis
Key Things to Consider:
- Identify the predictor
- Identify categorical, numerical features
- Work with SMEs and Domain Experts
- Domain knowledge is key to representing data correctly
You can find this dataset in the notebook on this page to inspect the dataset as well as use the link below to get more specific information for the different categories.
Code
If you need a code on the https://github.com/udacity.
Data Schema Quiz 1
QUIZ QUESTION::
Match what is True or False when analyzing the schema of a dataset?
ANSWER CHOICES:
Statement |
True/False |
---|---|
Identify correlations between features |
|
Domain expertise is not needed if you utilize recursive feature elimination. |
|
Identify the predictor attribute/feature. |
|
Identify possible categorical and numerical features. |
SOLUTION:
Statement |
True/False |
---|---|
Identify the predictor attribute/feature. |
|
Identify possible categorical and numerical features. |
|
Identify correlations between features |
|
Domain expertise is not needed if you utilize recursive feature elimination. |
|
Identify correlations between features |
|
Domain expertise is not needed if you utilize recursive feature elimination. |
|
Identify the predictor attribute/feature. |
|
Identify possible categorical and numerical features. |
|
Identify the predictor attribute/feature. |
|
Identify possible categorical and numerical features. |
|
Identify correlations between features |
|
Domain expertise is not needed if you utilize recursive feature elimination. |
Data Schema Quiz
QUIZ QUESTION::
Part - 1: Which features are categorical and which are numerical? Please use the Notebook workspace above and the UCI Heart Disease Dataset site to help you with this quiz.
ANSWER CHOICES:
Feature |
Type |
---|---|
slope |
|
thalach |
|
thal |
|
cp |
|
trestbps |
SOLUTION:
Feature |
Type |
---|---|
slope |
|
thal |
|
cp |
|
thalach |
|
trestbps |
|
thalach |
|
trestbps |
|
slope |
|
thal |
|
cp |
|
slope |
|
thal |
|
cp |
Data Schema Quiz
QUIZ QUESTION::
Part - 2: Which features are categorical and which are numerical? Please use the Notebook workspace above and the UCI Heart Disease Dataset site to help you with this quiz.
ANSWER CHOICES:
Feature |
Type |
---|---|
fbs |
|
ca |
|
oldpeak |
|
sex |
SOLUTION:
Feature |
Type |
---|---|
fbs |
|
ca |
|
sex |
|
fbs |
|
ca |
|
sex |
|
fbs |
|
ca |
|
sex |
|
oldpeak |